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Question 4 
Intent of Question 


The primary goal of this question was to assess a student’s ability to identify, set up, perform, and interpret 
the results of an appropriate hypothesis test to address a particular question. More specific goals were to 
assess a student’s ability to (1) state appropriate hypotheses; (2) identify the appropriate statistical test 
procedure and check appropriate conditions for inference; (3) calculate the appropriate test statistic and 
p-value; and (4) draw an appropriate conclusion, with justification, in the context of the study. 


Solution 


step 1: States a correct pair of hypotheses. 


Let Dasy represent the population proportion of adults similar to those in the study who would have 


developed colon cancer within the six years of the study if they had taken a low-dose aspirin each day. 
similarly, let p,},, represent the population proportion of adults similar to those in the study who 


would have developed colon cancer within the six years of the study if they had taken a placebo each 
day. 


The hypotheses to be tested are Ho : Dasyn = Pplac VETSUS Hag : Dasp < Pplag OF equivalently, 
Hy Dasy —Daige =) Versus. Hy.2 Di. Diige <¥: 

Step 2: Identifies a correct test procedure (by name or by formula) and checks appropriate conditions. 
The appropriate procedure is a two-sample z-test for comparing proportions. 
Because this is a randomized experiment, the first condition is that the volunteers were randomly 
assigned to one treatment group or the other. The condition is satisfied because we are told that the 
volunteers were randomly assigned to take a low-dose aspirin or a placebo. 
The second condition is that the sample sizes are large, relative to the proportions involved. ‘he 
condition is satisfied because all sample counts are large enough; that is, 15 with colon cancer in 


aspirin group, 26 with colon cancer in placebo group, 500 — 15 = 485 cancer-free in aspirin group, and 
500 — 26 = 474 cancer-free in placebo group. 


step 3: Calculates the appropriate test statistic and p-value. 


15 26 


The sample proportions who developed colon cancer are Dis = 509 > 0.030 and Dee = 509 > O:052: 
. ah 15 + 26 
The combined sample proportion who developed colon cancer is Doombined = Gan eaw = 0.041. 
500 + 500 
The test statisticis z = ———— ~ —1.75 (-1.7542 from calculator). 
0.041(1 — 0.040 ( ‘ aa 
The p-value is P(Z < —1.75) = 0.0401 (0.0397 from calculator), where Z has a standard normal 


distribution. 
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Question 4 (continued) 
step 4: States a correct conclusion in the context of the study, using the result of the statistical test. 


Because the p-value is less than the given significance level of a = 0.05, we reject the null hypothesis. 
The data provide convincing statistical evidence that the proportion of all adults similar to the 
volunteers who would develop colon cancer if they had taken a low-dose aspirin every day is less than 
the proportion of all adults similar to the volunteers who would develop colon cancer if they had not 
taken a low-dose aspirin every day. 


Scoring 


Steps 1, 2, 3, and 4 are scored as essentially correct (E), partially correct (P), or incorrect (J). 
Step 1 is scored as follows: 


Essentially correct (E) if the response identifies correct parameters AND both hypotheses are labeled 
and state the correct relationship between the parameters. 


Partially correct (P) if the response identifies correct parameters OR states correct relationships, but not 
both. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: Hither defining the parameters in context, or simply using common parameter notation with 
subscripts clearly relevant to the context, such as p,,, and Dpjgc, 1s sufficient. 


Step 2 is scored as follows: 


Essentially correct (E) if the response correctly includes the following three components: 
1. Identifies the correct test procedure (by name or by formula). 
2. Notes that the use of random assignment satisfies the randomness condition. 
3. Checks for approximate normality of the test statistic by citing that all four counts are larger 
than some standard criterion such as 5 or 10. 


Partially correct (P) if the response correctly includes only two of the three components. 
Incorrect (I) if the response correctly includes at most one of the three components. 


Notes: 

e For the randomness component, it is (minimally) acceptable to say “random assignment — 
check” but not acceptable to say “random — check” or “SRS — check.” ‘The important concept 
here is that it is random assignment, and not random sampling, that is required. If the response 
implies that the study used a random sample, the randomness component is not satisfied, 
regardless of whether random assignment is correctly addressed. 

e ‘The normality check may use the expected counts under the null hypothesis in place of 
observed counts. 
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Question 4 (continued) 
Step 3 is scored as follows: 


Essentially correct (E) if the response correctly calculates both the test statistic and a p-value that is 
consistent with the stated alternative hypothesis. 


Partially correct (P) if the response correctly calculates the test statistic but not the p-value; 
OR 
if the response calculates the test statistic incorrectly but then calculates the correct p-value for the 
computed test statistic; 
OR 
if the response reports the correct p-value but no calculations or test statistic are shown. 


Incorrect (I) if the response fails to meet the criteria for E or P. 


Note: The p-value is considered correct if it is consistent with the alternative stated in the response 
and the calculated test statistic, even if those are incorrect. 


Step 4 is scored as follows: 


Essentially correct (E) if the response provides a correct conclusion in context, with justification based 
on linkage between the p-value and the given @ = 0.05. 


Partially correct (P) if the response provides a correct conclusion, with linkage to the p-value, but not in 
context; 

OR 
if the response provides a correct conclusion in context, but without justification based on linkage to 
the p-value. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 

e ‘The conclusion must be related to the alternative hypothesis. 

e Ifthe p-value is incorrect, then step 4 is scored as E if the response includes proper linkage and 
a conclusion in context consistent with that p-value. 

e Ifthe p-value is less than 0.05, wording that states or implies that the alternative hypothesis is 
proven lowers the score one level (that is, from E to P or P to I) in step 4. 

e Ifthe p-value is incorrect and greater than 0.05, wording that states or implies that the null 
hypothesis is accepted lowers the score one level (that is, from E to P or P to I) in step 4. 
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Question 4 (continued) 


Fach essentially correct (E) step counts as 1 point. Each partially correct (P) step counts as % point. 


4 


3 


2 


1 


Complete Response 
Substantial Response 
Developing Response 


Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to decide whether to 
score up or down, depending on the overall strength of the response and communication. 


© 2015 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


aT “Al YA 


4. A researcher conducted a medical study to investigate whether taking a low-dose aspirin reduces the chance 
of developing colon cancer. As part of the study, 1,000 adult volunteers were randomly assigned to one of two 
“> groups. Half of the volunteers were assigned to the experimental group that took a Awzdose Spin each day, 
ae J and the other half were assigned to the control group that took a placebo each day. At the end of six years, 
2) 15 of the pedple Who took the low-dose aspirin had developed colomcancer and_26 of the people who took 
“the placebo had developed colon cancer. At the significance level a = 0.05, do the data provide convincing 
statistical evidence that taking a low-dose aspirin each day would reduce the chance of developing colon cancer 


among all people meng % the volunteers? 0 | 
cee do ueloy (olan Cli ct ae ‘in (WW) any ii € WAM 


pl Ron i pony ie Who vn - fad, 
) Prat ray H penal ud Aublep Clu CAN yr B Paar eal - hs 
Pa ~7 ty 03 cys 


2)tto: Pre Ce Ata: P. <P, - 969 
) 4 Went Yandel Q Sig iu (c*. ee Ue 
; (25°00) > Np eS NaC P25 yor nd 
ab.5.69 MaS25 20525 THSES I agus 

2 (SuiW WL inde Genagiyt s C CA ples 


A) ropa 248 





p= . 057.024 
7 TAS NEED ), oak 4s) 
\J XO 560 


@)Z2= ~ 1.754 


o™< 


8) p valve > oat 


| oe 0 0 } O( PG {le 

J) SiN P< ad, WETS | | 
(O)TUVC (6 GyARU QMA oWOL {CALS rai ng Pree 
aspiniy goth ‘day wo pu hug she MING OF my 
DION QUY detov\y ai peogte Stmnilay to the Voluiiee 


may Delt Eee page. Pee. GO ON TO THE NEXT PAGE. 


-11- 


©2015 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


7 4B/ 
4. A researcher conducted a medical study to investigate whether taking a low-dose aspirin reduces the chance 
of developing colon cancer. As part of the study, 1,000 adult volunteers were randomly assigned to one of two 
groups. Half of the volunteers were assigned to the experimental group that took a low-dose aspirin each day, 
and the other half were assigned to the control group that took a placebo each day. At the end of six years, 
15 of the people who took the low-dose aspirin had developed colon cancer and 26 of the people who took 
the placebo had developed colon cancer. At the significance level a = 0.05, do the data provide convincing 
Statistical evidence that taking a low-dose aspirin each day would reduce the chance of developing colon cancer 
among all people similar to the volunteers? 


= Frue Proegortian oF sein Who took mediciqa and aet calor, Conce”, 


% fete ?, ryt? Ov« Dut Tr .Q{ +e Occ Orie WIC heck saute Gad qer COLON cances , 


a * .05 lo: 0,* °, lo: P,¢ Po 


LE cond tions are met we wo A CASe A Lsamole 2 reSs+ For PCOPor +\0NS . 


oad 


ondom = 100 adult volunteers weve (Qadomly o&sianed to one of two QICUDS . 
tit 

loormol = 26 ,4YT4 16 , UBS are all 210 va 

Tack pendent - Were Are at least (000 "lO = 10000 pecple Sitmlor to voluricersN 


Ploy (\ 


P= 0394 


Jo 





2: \842- 
Becouse Pvalue « & (.0397 < 05) it 'S statist coy siqnctioa, 
we Ware suclicicat evidence to conclude har hee 
he O \ou- dose aspera each q ay would reduce the 
Chovace of FOr.k ah Colon Concey a MOA OA\\ 
PLogle sim lar to tbe UGIUN FEES, . 


any part of this page is illegal. GO ON TO THE NEXT p AGE. 


-11- 


©2015 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


~ Yo) te 


4. A researcher conducted a medical study to investigate whether taking a low-dose aspirin reduces the chance 


of developing colon cancer. As part of the study, 1,000 adult volunteers were randomly assigned to one of two 


sroups. Half of the volunteers were assigned to the experimental group that took a low-dose aspirin each day, 
and the other half were assioned to the control group that took a placebo each day. At the end of six years, 

15 of the people who t - irin had developed colon cancer and 26 of the people who took 

the placebo had developed colon cancer. At the significance level a = 0.05, do the data provide convincing 
“Statistical evidence that taking a low-dose aspirin each day would reduce the chance of developing colon cancer 
among all people similar to the volunteers? 


ato. o? 
State : Dotoy mire ae Fakino, low-dose asorin. each dol woo 


roduee He chance of dave, Colo Cancer: IS ShobsHeat 
gigntheatt. =, 05 Ho: PXO P= Pr 


Ha AWG 0,< Pa - 
Dion: 71 Sample 2-test For proprtion IF oidHoS oxre (rok 
Tondam : volnker$ W2re Yandomly aSsigneot Yo 1 ovaps. 


namal: big enool Sample Sze = 1000 >So, 
- ihdeperdiat: 1000 «10 = 10,000 -MmeelS tne lO% Yule. 


Do’ 
— Conclude: We Can Cofciide Hot 


K,= 16 fhs datn 1§ Sfokskeally 

an Sonhcant becawse he O-Vald 

ae — Is les Han a=. (. 044.06). 

" O- @gar .02A So G2 Can conclude Har takwg 
OQ lav dd agin oR HAr 
ORK clay JOB reoluoe Hw darior of 
Qobting CoM COMCE 





any part of this page is illegal. GO ON TO THE NEXT PAGE. 


-11- 


©2015 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


AP® STATISTICS 
2015 SCORING COMMENTARY 


Question 4 
Overview 


The primary goal of this question was to assess a student's ability to identify, set up, perform, and interpret 
the results of an appropriate hypothesis test to address a particular question. More specific goals were to 
assess a student’s ability to (1) state appropriate hypotheses; (2) identify the appropriate statistical test 
procedure and check appropriate conditions for inference; (3) calculate the appropriate test statistic and 
p-value; and (4) draw an appropriate conclusion, with justification, in the context of the study. 


Sample: 4A 
Score: 4 


In step 1 correct hypotheses are given and the parameters are clearly defined as population values for each 
group. The phrase “proportion of people who develop colon cancer after taking low-dose aspirin each day” 
clearly defines one of the parameters. Step 1 was scored as essentially correct. In step 2 a correct procedure 
is identified by name and the formula is shown with the numbers substituted in. The response notes “given 
random assignment” and provides formulas and expected counts under the null hypothesis for the normality 
check. ‘The use of expected counts here rather than observed counts is acceptable. Though these formulas 
are in terms of population parameters rather than sample quantities, this was judged a minor error. As all 
three components are correct, step 2 was scored as essentially correct. In step 3 correct values of the test 
statistic and p-value are provided, so step 3 was scored as essentially correct. In step 4 linkage and a correct 


decision are provided by the statement “since p < a, we reject Hy,” anda correct conclusion in context is 


given. Step 4 was scored as essentially correct. Because all four steps were scored as essentially correct, the 
response earned a score of 4. 


Sample: 4B 
Score: 3 


Correct hypotheses are given in step 1. However, the parameters are incorrectly defined by stating the “true 
proportion of people who took medicine and get colon cancer.” Use of the word “true” appears to reference a 
population quantity, but use of the past tense (took rather than take) appears to reference sample quantities 
and not population values. In addition, the phrasing does not imply a conditional proportion (of those who 
could have taken aspirin, the proportion who get colon cancer). With one correct component and one 
incorrect component, step 1 was scored as partially correct. In step 2 the response identifies the name of the 
test, correctly addresses the random assignment condition, and provides all four observed counts, indicating 
that they are all greater than or equal to 10. As all three components are satisfied, step 2 was scored as 
essentially correct. In step 3 correct values of the test statistic and p-value are included, and step 3 was 
scored as essentially correct. In step 4 the comparison of the p-value to 0.05 provides linkage. The phrase “it 
is statistically significant” is ambiguous. The response has a correct conclusion in context. Although an 
explicit decision (such as reject H,) is not provided, it is not necessary when linkage and a correct 
conclusion in context are given. Step 4 was scored as essentially correct. Because three steps were scored 
as essentially correct, and one step was scored as partially correct, the response was initially scored as a 
3%. Looking holistically at the paper, the error in step 1 was viewed as serious, and the statement about 
statistically significant was ambiguous. Therefore, the response earned a score of 3. 
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Question 4 (continued) 


Sample: 4C 
Score: 2 


In step 1 the response states hypotheses in terms of common parameters but uses subscripts that are not 
clearly relevant to the context of the problem and are not defined in the response. As a result, it is not 
possible to determine whether or not the relationship between parameters given in the alternative 
hypothesis is correct. Therefore neither component of step 1 is satisfied, and step 1 was scored as incorrect. 
In step 2 the response identifies a correct procedure by name, correctly addresses the randomness condition, 
but provides an incorrect check of the normality condition (1,000 > 30). Because two of the three 


components are satisfied, step 2 was scored as partially correct. In step 3 the test statistic and p-value are 
correctly stated, and step 3 was scored as essentially correct. In step 4 the response provides linkage 
between the p-value and a = .05, and a conclusion is provided in context. But the conclusion is incorrect as 
it asserts definitively that taking a low dose of aspirin does reduce the chance of getting colon cancer. Step 4 
was scored as partially correct. Because one step was scored as essentially correct, two steps were scored as 
partially correct, and one step was scored as incorrect, the response earned a score of 2. 
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